Extraction of Procedural Knowledge from the Web
نویسندگان
چکیده
User generatedWeb content includes large amounts of procedural knowledge (also called how to knowledge). This paper is on a comparison of two extraction methods for procedural knowledge from the Web. Both methods create workflow representations automatically from text with the aim to reuse the Web experience by reasoning methods. Two variants of the workflow extraction process are introduced and evaluated by experiments with cooking recipes as a sample domain. The first variant is a term-based approach that integrates standard information extraction methods from the GATE system. The second variant is a frame-based approach that is implemented by means of the SUNDANCE system. The expert assessment of the extraction results clearly shows that the more sophisticated frame-based approach outperforms the term-based approach of automated workflow extraction.
منابع مشابه
Presenting a method for extracting structured domain-dependent information from Farsi Web pages
Extracting structured information about entities from web texts is an important task in web mining, natural language processing, and information extraction. Information extraction is useful in many applications including search engines, question-answering systems, recommender systems, machine translation, etc. An information extraction system aims to identify the entities from the text and extr...
متن کاملA Comparison of Expert and Novice Iranian EFL Teachers’ Procedural Knowledge in Iranian Language Institutes and Universities
This study sought to compare Iranian EFL novice and expert teachers regarding their procedural knowledge in Iranian language institutes and universities. A questionnaire was developed based on the literature, the theoretical framework, and the results of a qualitative study. This questionnaire was administered to the whole sample of the study who was 200 Iranian EFL teachers from different gend...
متن کاملData Extraction using Content-Based Handles
In this paper, we present an approach and a visual tool, called HWrap (Handle Based Wrapper), for creating web wrappers to extract data records from web pages. In our approach, we mainly rely on the visible page content to identify data regions on a web page. In our extraction algorithm, we inspired by the way a human user scans the page content for specific data. In particular, we use text fea...
متن کاملProceduralization and Transfer of Linguistics Knowledge as a Result of Form-focused Output and Input Practice
This study compared the effects of two types of form-focused tasks on proceduralization and transfer of linguistics knowledge in case of English modals. All participants of the study attended pretests, posttests and delayed posttests. The procedural comprehension and production knowledge were measured through the groups’ performance on a timed dual task test that resembled the context of practi...
متن کاملAnswering Science Questions: Deduction with Answer Extraction and Procedural Attachment
An approach to question answering through automated deduction is advocated. Answers to questions are extracted from proofs of associated conjectures over an axiomatic theory of the subject domain. External knowledge resources, including data and software, are consulted through a mechanism known as procedural attachment. A researcher ignorant of the subject domain theory or its logical language ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012